Theory and practice of ambiguity labelling with a view to interactive disambiguation in text and speech MT
نویسندگان
چکیده
In many contexts, automatic analyzers cannot fully disambiguate a sentence or an utterance reliably, but can produce ambiguous results containing the correct interpretation. It is useful to study vatious properties of these ambiguities in the view of subsequent total or partial interactive disambiguation. We have proposed a technique for labelling ambiguities in texts and in dialogue transcriptions, and experimented it on multilingual data. It has been first necessary to define formally the very notion of ambiguity relative to a representation system, as well as associated concepts such as ambiguity kernel, ambiguity scope, ambiguity occurrence.
منابع مشابه
Interactive Real-time Translation via the Internet
The Internet offers a tremendous opportunity for experiments with real-time machine translation (MT) dialogues. Interchanges may be typed, or speech recognition (SR) techniques might be used to accept spoken input. In either case, given the number of ambiguities still escaping automatic resolution, interactive disambiguation for both components seems likely to remain important in many Internet ...
متن کاملA Context-Sensitive Homograph Disambiguation in Thai Text-to-Speech Synthesis
Homograph ambiguity is an original issue in Text-to-Speech (TTS). To disambiguate homograph, several efficient approaches have been proposed such as part-of-speech (POS) n-gram, Bayesian classifier, decision tree, and Bayesian-hybrid approaches. These methods need words or/and POS tags surrounding the question homographs in disambiguation. Some languages such as Thai, Chinese, and Japanese have...
متن کاملAn Interactive Disambiguation Module for English Natural Language Utterances
An interactive disambiguation methodology has been proposed and implemented at the GETA lab. in the framework of Dialogue-Based Machine Translation. This methodology has been generalized and re-engineered at ATR-ITL in the framework of spoken language transtlation and the MIDDIM project, a joint research between ATR-ITL and the GETA-CNRS aimed to study Multimodal Interactive Disambiguation. The...
متن کاملPart of Speechtagger for Kannada
Parts of speech tagging is a well-understood problem in NLP. The importance of the problem focuses from the fact that the Parts of Speech tagging is one of the first stages in the process performed by various natural language related process. POS tagging is the process of assigning the part of speech tag or other lexical class marker to each and every word in a sentence. POS tagging has a cruci...
متن کاملComparison of Point of View based on Gérard Genette’s Focalization theory in Sophocles’ Ajax and Faulkner’s The Sound and the Fury
Sophocles’ Ajax is among the most significant plays in ancient Greek literature, and William Faulkner’s The Sound and the Fury is an important modern novel. This study considers these two works using Gérard Genette’s Focalization term in Narratology theory. Focalization is a feature of the point of view in narrative and is described by characters’ action, emotional reaction and speech. This stu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996